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(57) Abstract: A camera-based touch system <50) includes a passive touch surface {60) and at least two cameras (63) associated 
with the touch surface. The at least two cameras (63) have overlapping fields of view (FOV) encompassing the touch surface. The 
at least two cameras (63) acquire images of the touch surface from different locations and generate image data. A processor (54) 
receives and processes image data generated by the at least two cameras to determine the location of the pointer relative to the touch 
surface when the pointer is captured in images acquired by the at least two cameras. Actual pointer contact with the touch surface 
and pointer hover above the touch surface can be determined. 
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CAMERA-BASED TOUCH SYSTEM 

TECHNICAL FIELD 

The present invention relates generally to touch systems and in 
particular to a camera-based touch system. 

5 BACKGROUND ART 

Touch systems are well known in the art and typically include a touch 
screen having a touch surface on which contacts are made using a pointer in order to 
generate user input. Pointer contacts with the touch surface are detected and are used 
to generate corresponding output depending on areas of the touch surface where the 
10 contacts are made. There are basically two general types of touch systems available 
and they can be broadly classified as "active" touch systems and "passive" touch 
systems. 

Active touch systems allow a user to generate user input by contacting 

the touch surface with a special pointer that usually requires some form of on-board 
15 power source, typically batteries. The special pointer emits signals such as infrared 

light, visible light, ultrasonic frequencies, electromagnetic frequencies, etc. that 

activate the touch surface. 

Passive touch systems allow a user to generate user input by contacting 

the touch surface with a passive pointer and do not require the use of a special pointer 
20 in order to activate the touch surface. A passive pointer can be a finger, a cylinder of 

some material, or any suitable object that can be used to contact some predetermined 

area of interest on the touch surface. 

Passive touch systems provide advantages over active touch systems in 

that any suitable pointing device, including a user's finger, can be used as a pointer to 
25 contact the touch surface. As a result, user input can easily be generated. Also, since 

special active pointers are not necessary in passive touch systems, battery power 

levels and/or pointer damage, theft, or pointer misplacement are of no concern to 

users. 

Passive touch systems have a number of applications relating to 
30 computer operation and video display. For example, in one interactive application, as 
is disclosed in U.S. Patent No. 5,448,263 to Martin, assigned to the assignee of the 
present invention, a passive touch system is coupled to a computer and the computer 
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display is presented on the touch surface of the touch screen. The coordinates 
representing specific locations on the touch surface are mapped to the computer 
display. When a user contacts the touch surface, the coordinates of the contact 
position are fed back to the computer and mapped to the computer display thereby 

5 allowing the user to operate the computer in a manner similar to using a computer 
mouse simply by contacting the touch surface. Furthermore, the coordinates fed back 
to the computer can be recorded in an application and redisplayed at a later time. 
Recording contact coordinates is typically done when it is desired to record 
information written or drawn on the touch surface by the user. 

10 The resolution of a passive touch screen determines if the touch system 

is suitable for recording information written or drawn on the touch screen or only 
useful for selecting areas on the touch screen mapped to regions on the computer or 
video display in order to manipulate the computer or video display. Resolution is 
typically measured in dots per inch (DPI). The DPI is related to the size of the touch 

15 screen and the sampling ability of the touch system hardware and software used to 
detect contacts on the touch surface. 

Low-resolution passive touch screens only have enough DPI to detect 
contacts on the touch surface within a large group of pixels displayed by the computer 
or video display. Therefore, these low-resolution passive touch screens are useful 

20 only for manipulating the computer or video display. 

On the other hand, high-resolution passive touch screens have 
. sufficient DPI to detect contacts that are proportional to a small number of pixels or 
sub-pixels of the computer or video display. However, a requirement for high- 
resolution touch screens is the ability to detect when the pointer is in contact with the 

25 touch surface. This is necessary for writing, drawing, mouse-click operations, etc. v 
Without the ability to detect pointer contact with the touch screen, writing and 
drawing would be one continuos operation, and mouse clicks would not be possible 
thereby making computer display manipulation virtually impossible. A secondary 
requirement is the ability to detect when the pointer is "hovering" above the touch 

30 surface. Although not required for writing or drawing, today ' s computer operating 
systems are increasingly using hover information to manipulate computer or video 
displays or pop-up information boxes. 
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Passive touch screens are typically either of the analog resistive type, 
surface acoustic wave (SAW) type or capacitive type. Unfortunately, these touch 
screens suffer from a number of problems or shortcomings as will be described. 

Analog resistive touch screens typically have a high-resolution. 
5 Depending on the complexity of the touch system, the resolution of the touch screen 
can produce 4096x4096 DPI or higher. Analog resistive touch screens are 
constructed using two flexible sheets that are coated with a resistive material and 
arranged as a sandwich. The sheets do not come into contact with each other until a 
contact has been made. The sheets are typically kept separated by insulating 

10 microdots or by an insulating air space. The sheets are constructed from ITO, which 
is mostly transparent. Thus, the touch screen introduces some image distortion but 
very little parallax. 

During operation of an analog resistive passive touch screen, a uniform 
voltage gradient is applied in one direction along a first of the sheets. The second 

1 5 sheet measures the voltage along the first sheet when the two sheets contact one 

another as a result of a contact made on the touch surface. Since the voltage gradient 
of the first sheet can be translated to the distance along the first sheet, the measured 
voltage is proportional to the position of the contact on the touch surface. When a 
contact coordinate on the first sheet is acquired, the uniform voltage gradient is then 

20 applied to the second sheet and the first sheet measures the voltage along the second 
sheet. The voltage gradient of the second sheeit is proportional to the distance along 
the second sheet. These two contact coordinates represent the X-Y position of the 
contact on the touch surface in a Cartesian coordinate system. 

Unfortunately, since mechanical pressure is required to bring both 

25 sheets into contact, analog resistive touch screens can only detect contact when there 
is sufficient pressure to bring the two sheets together. Analog resistive passive touch 
screens also cannot sense when a pointer is hovering over the touch surface. 
Therefore, in the case of analog resistive touch screens contact events and positions 
can only be detected when actual contacts are made with the touch surface. 

30 Surface acoustic wave (SAW) touch screens typically provide for 

medium resolution and are not suitable for recording good quality writing. SAW 
touch screens employ transducers on the borders of a glass surface to vibrate the glass 
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and produce acoustic waves that ripple over the glass surface. When a contact is 
made on the glass surface, the acoustic waves reflect back and the contact position is 
determined from the signature of the reflected acoustic waves. 

Unfortunately, SAW touch screens exhibit noticeable parallax due to 
5 the thickness of the vibrating glass that is placed over the surface of the video or 
computer display. Also, contact events and positions can only be detected when 
actual contacts are made with the glass surface. Furthermore, SAW touch screens do 
not scale beyond a few feet diagonal. 

Capacitive touch screens provide for low resolution because contacts 
10 can only be determined in large areas (approximately V£"x 14"). As a result, capacitive 
touch screens cannot be used for recording writing or drawing but are suitable for 
selecting areas on the touch screen corresponding to computer generated buttons . 
. displayed on the video or computer display. Capacitive touch screens also suffer 
disadvantages in that they are sensitive to temperature and humidity. Similar to 
1 5 analog resistive touch screens and SAW touch screens, capacitive touch screens can 
also only detect contact events and positions when actual contacts are made with the 
touch surface. 

Scalability of passive touch screens is important since the demand for 
larger electronic digitizers is increasing. Where digitizers were once small desktop 
20 appliances, today they have found there way onto electronic whiteboarding 
applications. The need to build a passive touch sensitive "wall" has become a 
requirement for new touch screen applications. Existing passive touch screens of the 
types discussed above are all limited in the maximum size where they are still 
functional. 

25 As will be appreciated, improvements to passive touch systems are 

desired. It is therefore an object of the present invention to provide a novel camera- 
based touch system. 

DISCLOSURE OF THE INVENTION 

30 According to one aspect of the present invention there is provided a 

camera-based touch system comprising; 
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at least two cameras associated with a passive touch surface and 
having overlapping fields of view encompassing said touch surface, said at least two 
cameras acquiring images of said touch surface from different locations and 
generating image data; and 
5 a processor receiving and processing image data generated by said at 

least two cameras to determine the location of a pointer relative to said touch surface 
when said pointer is captured in images acquired by said at least two cameras. 

Preferably the at least two cameras are digital cameras having fields of 
view looking generally along the plane of the touch surface. The image data 

10 generated by each digital camera includes a pointer median line x and a pointer tip 
location z. Each of the digital cameras includes a pixel array having selectable pixel 
rows. Pixel intensities of pixels in the selectable pixel rows are used during 
generation of the image data. Preferably, pixel intensities of pixels in a region of 
interest within the selectable pixel rows are used during generation of the image data. 

15 In a preferred embodiment, each of the digital cameras includes a 

CMOS image sensor and a digital signal processor. The digital signal processor 
receives image output from the image sensor and executes a find pointer routine to 
determine if a pointer is in each image acquired by the digital camera and if so, the 
median line of the pointer. It is also preferred that the digital signal processor of each 

20 digital camera executes an update background image routine to update the 
background image after each image is acquired. Preferably, the digital signal 
processor of each digital camera further determines the differences between each 
acquired image and the background image to detect changing light conditions. 

According to another aspect of the present invention there is provided 

25 a camera-based touch system comprising: 

a generally rectangular passive touch surface on which contacts are 
made using a pointer; 

a digital camera mounted adjacent each comer of said touch surface, 
said digital cameras having overlapping fields of view encompassing said touch 

30 surface, said digital cameras acquiring images of said touch surface and generating 
image data that includes the median line x and pointer tip location z of a pointer when 
said pointer is captured in images acquired by said digital cameras; and 
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a processor receiving and processing image data generated by said 
digital cameras to determine the location of said pointer relative to said touch surface 
and whether said pointer is in contact with said touch surface. 

According to yet another aspect of the present invention there is 
5 provided a method of detecting the position of a pointer relative to a touch surface 
comprising the steps of: 

acquiring images of said touch surface from different locations using 
cameras having overlapping fields of view and generating image data; and 

processing said image data to detect the existence of a pointer within 
10 said acquired images and to determine the location of said pointer relative to said 
touch surface. 

The present invention provides advantages in that the passive touch 
system is of high resolution and allows actual pointer contacts with the touch surface 
as well as pointer hovers above the touch surface to be detected and corresponding 
1 5 output generated. Also, the present passive touch system provides advantages in that 
it does not suffer from parallax, image distortion, pointer position restrictions, image 
projection and scalability problems that are associated with prior art passive touch 
systems. 

Furthermore, the present invention provides advantages in that since 
20 CMOS digital cameras are used, arbitrary pixel rows in the digital camera pixel arrays 
can be selected. This enables the frame rates of the digital cameras to be increased 
significantly. Also, since the pixel rows can be arbitrary selected, the pixel arrays can 
be exposed for greater durations for given digital camera frame rates allowing for 
good operation in dark rooms as well as well lit rooms. 

25 

BRIEF DESCRIPTION OF THE DRAWINGS 

Embodiments of the present invention will now be described more 
fully with reference to the accompanying drawings in which: 

Figure 1 is a schematic diagram of a camera-based touch system in 
30 accordance with the present invention; 

Figure 2 is an isometric view of a touch screen forming part of the 
touch system of Figure 1 ; 
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Figure 3 is an isometric view of a corner portion of the touch screen of 

Figure 2; 

Figure 4 is a schematic diagram of a digital camera forming part of the 
touch screen of Figure 2; 
5 Figure 5 is a schematic diagram of a master controller forming part of 

the touch system of Figure 1 ; 

Figure 6 is a flowchart showing the steps performed during execution 
of a processFrame routine; 

Figure 7 is a flowchart showing the steps performed during execution 
10 of a segmentPointer routine; 

Figure 8 is a flowchart showing the steps performed during execution 
of a findPointer routine; 

Figure 9 shows an image acquired by a digital camera and a pixel 
subset of the image that is processed; 
1 5 Figure 1 0 shows a region of interest (ROI) within the pixel subset of 

Figure 9; 

Figure 1 1 shows triangulation geometry used to calculate a pointer 
contact position on the touch surface of the touch screen illustrated in Figure 2; 

Figure 12 shows an image acquired by a digital camera including the 
20 pointer tip and its median line; 

Figure 13 shows pointer contact and pointer hover for different 
orientations of the pointer; 

Figure 14 is an image of the touch surface of the touch screen as seen 
by a digital camera; 

25 Figures 15 and 16 show the results of a Matlab simulation of pointer 

tracking using a Kalman filter; and 

Figures 17a to 17d show the results of another Matlab simulation of 
pointer tracking using a Kalmian filter. 



30 



BEST MODE FOR CARRYING OUT THE INVENTION 

Turning now to Figure 1, a camera-based touch system in accordance 
with the present invention is shown and is generally identified by reference numeral 
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50. As can be seen, touch system 50 includes a touch screen 52 coupled to a digital 
signal processor (DSP) based master controller 54. Master controller 54 is also 
coupled to a computer 56. Computer 56 executes one or more application programs 
and provides display output that is presented on the touch screen 52 via a projector 58. 
5 The touch screen 52, master controller 54, computer 56 and projector 58 form a 
closed-loop so that user contacts with the touch screen 52 can be recorded as writing 
or drawing or used to control execution of application progranis executed by the 
computer 56. 

Figures 2 to 4 better illustrate the touch screen 52. Touch screen 52 

10 includes a touch surface 60 bordered by a rectangular frame 62. Touch surface 60 is 
in the form of a rectangular planar sheet of passive material. DSP-based CMOS 
digital cameras 63 are associated with each corner of the touch screen 52. Each 
digital camera 63 is mounted on a frame assembly 64. Each frame assembly 64 
includes an angled support plate 66 on which the digital camera 63 is mounted. 

15 Supporting frame elements 70 and 72 are mounted on the plate 66 by way of posts 74 
and secure the plate 66 to the frame 62. 

Each digital camera 63 includes a two-dimensional CMOS image 
sensor and associated lens assembly 80, a first-in-first-out (FIFO) buffer 82 coupled 
to the image sensor and lens assembly 80 by a data bus and a digital signal processor 

20 (DSP) 84 coupled to the FIFO 82 by a data bus and to the image sensor and lens 

assembly 80 by a control bus. A boot EPROM 86 and a power supply subsystem 88 
are also included. 

In the present embodiment, the CMOS camera image sensor is a 
Photobit PB300 image sensor configured for a 20x640 pixel subarray that can be 

25 operated to capture image frames at rates in excess of 200 frames per second since . 
arbitrary pixel rows can be selected. Also, since the pixel rows can be arbitrarily 
selected, the pixel subarray can be exposed for a greater duration for a given digital 
camera frame rate allowing for good operation in dark rooms as well as well lit 
rooms. The FIFO buffer 82 is manufactured by Cypress under part number 

30 CY7C421 IV and the DSP 84 is manufactured by Analog Devices under part number 
ADSP2185M. 
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The DSP 84 provides control information to the image sensor and lens 
assembly 80 via the control bus. The control information allows the DSP 84 to 
control parameters of the image sensor and lens assembly 80 such as exposure, gain, 
. array configuration, reset and initialization. The DSP 84 also provides clock signals 
5 to the image sensor and lens assembly 80 to control the frame rate of the image sensor 
and lens assembly 80. 

The angle of the plate 66 is selected so that the field of view (FOV) of 
each digital camera 63 extends beyond a designated peripheral edge of the touch 
surfece 60 as shown in Figure 1 1 . In this way, the entire touch surface 60 is within 
10 the fields of view of the digital cameras 63 . 

Master controller 54 is best illustrated in Figure 5 and includes a DSP 
90, a boot EPROM 92, a serial line driver 94 and a power supply subsystem 95. The 
DSP 90 communicates with the DSPs 84 of the digital cameras 63 over a data bus via 
a serial port 96 and communicates with the computer 56 over a data bus via a serial 
15 port 98 and the serial line driver 94. In this embodiment, the DSP 90 is also 

manufactured by Analog Devices under part number ADSP2185M. The serial line 
driver 94 is manufactured by Analog Devices under part number ADM222. 

The master controller 54 and each digital camera 63 follow a 
communication protocol that enables bi-directional communications via a common 
20 serial cable similar to a universal serial bus (USB). The transmission bandwidth is 
divided into thirty-two (32) 16-bit channels. Of the thirty-two channels, six (6) 
channels are assigned to each of the DSPs 84 in the digital cameras 63 and to the DSP 
90 in the master controller 54 and the remaining two (2) channels are unused. The 
master controller 54 monitors the twenty-four (24) channels assigned to the DSPs 84 
25 while the DSPs 84 monitor the six (6) channels assigned to the DSP 90 of the master 
controller 54. Communications between the master controller 54 and the digital 
cameras 63 are performed as background processes in response to interrupts. 

The general operation of the touch system 50 will now be described. 
Each digital camera 63 acquires images of the touch surface 60 within the field of 
30 view of its image sensor and lens assembly 80 at a desired frame rate and processes 
each acquired image to determine if a pointer is in. the acquired image. If a pointer is 
in the acquired image, the image is further processed to determine characteristics of 
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the pointer contacting or hovering above the touch surface 60. Pointer information 
packets (PIPs) including pointer characteristics, status and/or diagnostic information 
are then generated by the digital cameras 63 and the PIPs are queued for transmission 
to the master controller 54. The digital cameras 63 also receive and respond to 
5 command PIPs generated by the master controller 54. 

The master controller 54 polls the digital cameras 63 for PIPs. If the 
PIPs include pointer characteristic information, the master controller 54 triangulates 
pointer characteristics in the PIPs to determine the position of the pointer relative to 
the touch surface 60 in Cartesian rectangular coordinates. The master controller 54 in 

10 turn transmits calculated pointer position data, status and/or diagnostic information to 
the personal computer 56. In this manner, the pointer position data transmitted to the 
personal computer 56 can be recorded as writing or drawing or can be used to control 
execution of application programs executed by the computer 56. The computer 56 
also updates the display output conveyed to the projector 58 so that information 

1 5 presented on the touch surface 60 reflects the pointer activity. 

The master controller 54 also receives commands from the personal 
computer 56 and responds accordingly as well as generates and conveys command 
PIPs to the digital cameras 63 . 

Specifics concerning the processing of acquired images and the 

20 triangulation of pointer characteristics in PIPs will now be described with particular 
reference to Figures 6 to 8. 

Initially, a camera offset angle calibration routine is performed to 
determine the offset angle 8 of each digital camera 63 (see Figure 1 1) so that the 
contact or hover position of a pointer relative to the touch surface 60 can be 

25 accurately determined. Details of the camera offset angle calibration are described in 
Applicants' co-pending U.S. application entitled "Calibrating Camera Offsets to 
Facilitate Object Position Determination Using Triangulation" filed on June 1, 2001, 
the content of which is incorporated herein by reference. 

Following the camera offset angle calibration routine, a surface 

30 detection routine is performed to enhance determination as to whether a pointer is in 
contact with the touch surface 60 at a given point or hovering above thetouch surface. 
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With rectangular coordinates of a pointer in the plane of the touch 
surface 60 accurately known from the camera offset angle calibration, the orientation 
of the touch surface 60 as seen by each digital camera 63 can be determined. This is 
necessary due to the fact that the digital cameras do not just see along the plane of the 
touch surface 60 but also in a direction perpendicular to it. To some degree, each 
digital camera 63 looks downward into the touch surface 60. Figure 14 generally 
shows the shape of the touch surface 60 as seen by a digital camera 63. Because of 
this, it is desired to define a "vertical" coordinate z which describes the touch surface 
location as a function of rectangular coordinates x and y. 

The z coordinate of the pointer can be measured from a digital camera^ 
image, and hence, z coordinates for pointer positions on the touch surface 60 can be 
determined. This vertical calibration becomes a matter of fitting the z coordinate data 
for given rectangular coordinates x and>». The vertical calibration can be described as 
a surface of the form: 

z(x,y) = Ax+By+Cx 2 +'Dy 2 +Exy+F * (0.1) 
Note that if the coefficients C, Z), and E are zero, this becomes a plane. The fit is 
easily computed as equation (0.1) represents a linear least-squares problem. The 
corresponding matrix takes the form: 



3h 



4 y\ 



yl 



A 
B 
C 
D 
E 



In order to fit the rectangular coordinates x and y to the equation (0. 1 ) 
to determine the coefficients A to E, the Moore-Penrose pseudo-inverse method that 
is based on singular value decomposition (SVD) is used to determine a minimum- 
norm least squares solution. 

As will be appreciated, a matrix can always be decomposed in the 
following way: 

A = USV T (0.2) 
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Matrixi4 can have any shape. The matrices U and V are orthogonal 
matrices, meaning that: 

U T U = I = V T V 

The diagonal matrix S is composed entirely of the singular values of 

5 matrix A , which are related to the squares of the eigenvalues of matrix A . The 

importance of the singular value decomposition (SVD) lies in the fact that with it, the 

inverse of matrix A can always be computed. Moreover, it is possible to control this 

inversion when a poorly determined problem is encountered. Consider the system of 

linear equations: i 
v v 

10 Ax = b ! 

whose solution would be: 
x = A-'b 

SVD allows the inverse of matrix A to be written as: 

A- l =VS~ l U T , (0.3) 

15 since both matrices U and Fare orthogonal. In a poorly determined situation, some of 
the singular values will be very small, so that when matrix S" 1 is formed, large values 
will be produced, which is not desirable. In this case, the inverses of the smallest 
singular values are set to zero. This has the effect of eliminating the poorly 
determined part of the solution. For least-squares problems, this is a powerful tool. 
20 The usual normal equations method for least-squares problems is based on solving: 

A T Ax = A T b (0.4) 

in the over-determined case, and solving: 

Ua t (AA t YI (0.5) 

25 in the under-determined case. As will be appreciated, during fitting of the system of 
equations to equation (0.1), the same method is used as is used during determination 
of the camera offset angles 8 . Since the same procedure is used, memory usage and 
processing speed is maintained at desired levels. 

With the coefficients A through E known, the z coordinate for any 

30 given (x,y) point on the touch surface can be calculated and thus, a determination can 
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be made as to whether a pointer is contacting the touch surface 60 or hovering above 

it. 

With the touch system 50 calibrated, during operation each digital 
camera 63 acquires images of the touch surface 60 within its field of view. The 
5 images are acquired by the image and lens assembly 80 at intervals in response to the 
clock signals received from the DSP 84. Each image acquired by the image and lens 
assembly 80 is sent to the FIFO buffer 82. The DSP 84 in turn reads each image from 
the FIFO buffer 82 and processes the image. To avoid processing significant numbers 
of pixels containing no useful information, only a subset of the pixels in the acquired 

10 image are processed as is shown in Figure 9. 

During processing of an image acquired by a digital camera 63, the 
DSP 84 executes a processFrame routine as shown in Figure 6. When an image is 
available for processing (step 120), a check is made to determine if the image has 
been captured for the purpose of adjusting the digital camera 63 (step 122). If the 

1 5 image has been acquired for the purpose of exposure adjustment, an exposureControl 
routine is called (step 124) to adjust the exposure of the digital camera 63. Following 
this, the DSP 84 awaits receipt of the next image available for processing. 

At step 122, if the image has not been captured for the purpose of 
adjusting the exposure of the digital camera 63, a check is made to determine if the 

20 image has been captured for the purpose of replacing a background image (step 126). 
If the image has been acquired for the purpose of background image replacement, a 
captureBackground routine is called (step 128) and the acquired image is used as the 
background image. This is done if a digital camera acquires an image and sends a PIP 
to the master controller indicating that a pointer is in the image when it is actually 

25 noise. Replacing the background image effectively inhibits the digital camera from 
falsely identifying a pointer in future PIPs. Following this, the DSP 84 awaits receipt 
of the next image available for processing. 

At step 126, if the image has not been captured for the purpose of 
background image replacement, a copylCur routine is called by the DSP 84 (step 

30 130). During this routine, the current acquired image is copied into memory and is 
used to update the background image as well as to form a difference image 
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representing the differences between the current acquired image and the background 
image. 

After completion of the copylCur routine, a segmentPointer routine is 
called (step 132) to determine if a pointer is in the acquired image and if so to 
5 determine the location of the pointer relative to the touch surface 60 and whether the 
pointer is in contact with the touch surface 60 or hovering above it. The 
segmentPointer routine 132 also allows changing light conditions to be detected. 
Following the segmentPointer routing 132, the DSP 84 calls a fillPJP routine (step 
134) to place the pointer and light condition information into a PIP for transmission to 
10 the master controller 54. Thereafter, the DSP 84 awaits receipt of the next image 
available for processing. 

Figure 7 illustrates the steps performed by the DSP 84 during 
execution of the segmentPointer routine 132. As can be seen, when the DSP 84 
executes the segmentPointer routine, the DSP 84 calls a findPointer routine to 
1 5 determine if a pointer is in the acquired image and if so, the position of the pointer in 
the current acquired image (step 140). Upon completion of the findPointer routine 
140, the DSP 84 calls an updateBackground routine to update the background image 
thereby to deal with changes in lighting conditions (step 142). 

During execution of the updateBackground routine, the DSP 84 
20 continuously updates the background image using the equation: 

B n+ i(y) = 0-a)B n (y)+aI(iJ) (0.6) 

where: 

B n +i is the new background image; 
B n is the current background image; 
25 I is the current acquired image; 

ij are the row and column coordinates of the background image pixels 

being updated; and 

a is a number between 0 and 1 that indicates the degree of learning that 
should be taken from the current acquired image L The larger the value of a, the 
3 0 faster the background image is updated. 

After the updateBackground routine 142 has been executed, the 
intensity difference between the current acquired image and the background image is 
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calculated by the DSP 84. This information is sent to the master controller 54 to 
enable the master controller to determine if the digital camera 63 needs to be re- 
exposed. This would be required if a drastic change in lighting conditions occurred 
(i.e. environment lighting was switched on or off). When re-exposure of the digital 
5 camera 63 is required, the master controller 54 sends a command PIP to the digital 
camera 63 instructing the digital camera to acquire an image for exposure adjustment. 

Figure 8 illustrates the steps performed by the DSP 84 during 
execution of the findPointer routine 140. As can be seen, when the DSP 84 executes 
the findPointer routine 140, the DSP 84 clears pointer location and pointer tip 

10 parameters x and z respectfully (step 150). Thereafter a vertical intensity histogram is 
built (step 152). During this stage, the difference image representing differences 
between the current image and background image is formed and pixel intensities in 
the difference image are summed by column. In this manner a 640 x 1 vector is 
formed that represents the sum of each column in the 640 x 20 difference image. 

15 Thus, the first element in the 640 x 1 vector represents the sum of the 20 pixels in the 
first column of the 640 x 20 difference image, the second element in the 640 x 1 
vector represents the sum of the 20 pixel in the second column of the 640 x 20 
difference image and so on. Further specifics of this process can be found in the 
article entitled " A smart camera application: DSP - based people detection and 

20 . tracking" authored by V. Cheng et al and published in the SPIE Journal of Electronic 
Imaging July, 2000. 

Following the creation of the vertical intensity histogram at step 152, 
the pointer location parameter x is determined by finding the column in the vertical 
intensity histogram with the highest intensity above a noise threshold (step 154). The 

25 column is used as the center of a region of interest (ROI) to be processed with the c 
width of the ROI being equal to the base of the peak formed by the vertical intensity 
histogram (see Figure 10). If no column has an intensity above the noise threshold, it 
is assumed no pointer is within the acquired image. 

When a pointer location parameter x is determined, the DSP 84 

30 analyses the ROI to determine the pixel row where the pointer tip is located and 
determine whether that row represents a touch surface contact or hover (step 1 56). 
Specifically, the DSP 84 creates a binary mask in the ROI so that white pixels 
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represent the pointer and black pixels represent the background as shown in Figure 
12. From the mask, the medium line of the pointer and the pointer tip location z can 
be easily calculated. 

During the fillPIP routine 134, the DSP 84 uses the pointer and light 
5 condition information acquired during execution of the segmentPointer routine 132 
and creates a PIP to reduce the acquired image to a small set of data thereby to 
provide bandwidth economy. The PIP is in the form of a six (6) word packet, with 
each word in the packet being sixteen (16) bits. The PIP typically takes the form: 



Header 



Data 



Checksum 



10 

The header portion of the PIP is typically sixteen (16) bits and includes 
a determination/source field, a data type field, an image frame number field, a 
sequence number field and a packet number field. The destination/source field 
identifies the PIP destination and the PIP source. If the PIP is generated by the master 

15 controller 54, the destination may be, a single digital camera 63 or all digital cameras. 
The data type indicates whether the PIP relates to pointer information or other 
information such as status and diagnostic information. The image frame number field 
stores a number so that images from each digital camera 63 are processed by the 
master controller 54 in sequence. The sequence number field stores a number that 

20 relates the PIP to other PIPs. The packet number field stores a number identifying the 
packet. 

The data portion of the PIP is typically sixty-four (64) bits and includes 
a pointer ED field, a pointer location parameter field, a pointer tip parameter field, a 
contact state field and a goodness of pointer field. The pointer ID field stores an 

25 identifier for the pointer to allow multiple pointers to be tracked. The pointer location 
parameter field stores the x-value calculated by the DSP 84. The pointer tip parameter 
field stores the z-value calculated by the DSP 84. The contact state field stores a 
value that indicates whether the pointer is in contact, out of contact or possibly in 
contact with the touch surface 60. The goodness of pointer field stores a statistical 

30 value on the likelihood that a detected pointer is real. 
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The checksum portion of the PIP is used to ensure PIP transmission 
integrity. If PIP checksum errors are infrequent, the PIPs exhibiting checksum errors 
are ignored by the destination device. 

Status PIPs that do not relate to pointer information have a different 
5 form then the above-identified described PIPs. For PBPs of this nature, the data 
portion includes an instruction type field, an instruction code field and a data field. 
The instruction type field identifies whether the instruction type is an instruction to be 
performed or a status request. The instruction code field stores the actual instruction 
or status request identifier. The data field stores data that varies depending on the 
10 type of instruction. Examples of status PIPs include frame header PIPs, command 
PIPs and error message PIPs. 

A frame header PIP typically includes the number of pointer PIPs that 
are to follow for a current acquired image with statistics for the current image such as 
intensity variance between the current acquired image and a previous image. A 
1 5 command PIP issued by the master controller 54 may instruct a digital camera to 

adjust one or more of its settings such as exposure or capture an image to be used as a 
new background image. An error PIP may pass an error condition from a digital 
camera 63 to the master controller 54 for storage in an error log. 

Each digital camera 63 processes each image it acquires in the manner 
20 described above in response to each clock signal generated by its DSP 84. The PIPs 
created by the DSPs 84 are only sent to the master controller 54 when the digital 
cameras 63 are polled by the master controller 54. 

When the master controller 54 polls the digital cameras 63, frame sync 
pulses are sent to the digital cameras 63 to initiate transmission of the PIPs created by 
25 the DSPs 84. Upon receipt of a frame sync pulse, each DSP 84 transmits the PIP to c 
the master controller 54 over the data bus. The PIPs transmitted to the master 
controller 54 are received via the serial port 96 and auto-buffered into the DSP 90. 

After the DSP 90 has polled the digital cameras 63 and has received 
PIPs from each of the digital cameras 63 that include pointer information, the DSP 90 
30 processes the PIPs using triangulation to determine the location of the pointer relative 
to the touch surface 60 in (x,y) coordinates. Specifically, the PIPs from pairs of 
digital cameras 63 are processed using triangulation. 
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Figure 1 1 shows that two angles and 0^ are needed to 
triangulate the position (x 0 , y 0 ) of a pointer relative to the touch screen 60. The PIPs 
generated by each digital camera 63 include a number 0 (see Figure 12) identifying 
the median line or tip of the pointer. When the master controller 54 receives a PIP 
5 from a digital camera 63, the master controller uses the number representing the 

median line or tip of the pointer and the field of view of the digital camera to calculate 
an angle <f> C am using the equation: 




(0. 



10 where: 

x is the number representing the median line or tip of the pointer; and 

a is the total length enclosed by the field of view (FOV) of the digital 
camera at a distance from the camera. 

The calculated angle $ cam is equal to the angle formed between the 
15 extremity of the field of view extending beyond the designated peripheral edge of the 
touch surface 60 of the digital camera 63 that generated the PIP and a line extending 
from the optical axis of the digital camera that intersects the pointer within the 
acquired image. Preferably, the extremity of the field of view extends beyond the 
designated peripheral edge (i.e. in this case the x-axis) of the touch surface 60 within 
20 the field of view by a known amount. However, in almost all cases the angular offset 
$ cam scan of digital camera 63 is different and unknown. 

Once the master controller 54 calculates the angle 0^ , the master 
controller 54 uses the camera offset angle 5^ determined during the camera offset 
calibration to adjust the angle $ m . With the two angles available and with the 
25 angles 0^ adjusted, the master controller 54 uses the angles ^ to determine the 
position of the pointer relative to the touch surface 60 using triangulation. 

In this embodiment, since the touch screen 52 includes four digital 
cameras 63, six pairs of digital cameras can be used for triangulation. The following 
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discussion describes how a pointer position is determined by triangulation for each 
pair of the digital cameras 63 . 

In order to determine a pointer position using the PDPs received from 
the digital cameras 63 along the left side of the touch screen 52, the following 
5 equations are used to determine the (jc 0 , y 0 ) coordinates of the pointer position given 
the angles ^ 0 and fa for the upper and lower digital cameras: 



* >iAan(^ 0 )+tanfa) ( °" 8) 



10 ''-tanGO+tanfo) 



where: 

h is the height of the touch screen 52 i.e. the vertical distance from 
digital camera focal point-to-focal point; 
15 w is the width of the touch screen 52 i.e. the horizontal distance from 

digital camera focal point-to-focal point; and 

fa is the angle with respect to the horizontal, measured using digital 

camera i and equation (0.7). 

For the digital cameras 63 along on the right side of the touch screen 
20 52, the following equations are used to determine the (x 0 , y 0 ) coordinates of the 

pointer position given the angles fa and fa for the upper and lower digital cameras: 



x 0 =1-A X _1 (0.10) 

0 w tan($y+tan($y 



25 t^tth (011) 

tan(^ 2 )+tan(^ 3 ) 
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The similarity between equations (0.8) and (0.10), i.e. equation (0.10) 
= 1 - equation (0.8) once angles ff> 2 and ^ 3 have been substituted into equation (0.8) 
for angles 0, and <j> 2 respectively should be apparent. Equations (0.9) and (0.11) are 
related in a similar manner. 
5 In order to determine a pointer position using the digital cameras 63 

along the bottom of the touch screen 52, the following equations are used to 
determine the (x Q9 y 0 ) coordinates of the pointer position given the angles $ 0 and ^ 3 
for bottom left and bottom right digital cameras: 

tan(&)+tan(&) 



*-* x *t4)™ (0 . 13) 



= — xx 0 xtan(^ 0 ) 
h 



In order to determine a pointer position using the digital cameras 63 
15 along the top of the touch screen 52, the following equations are used to determine the 
(x 0 ,y 0 ) coordinates of the pointer position given the angles fa and </> 2 for the top left 

and top right digital cameras: 

*°= rffk) (0.14) 
tanfo)+tanp 4 

20 



,0 h tan(^)+tan(^ 2 ) 



(0.15) 



= l-yxx 0 xtani 
h 
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The similarity between equations (0.12) and (0.14), i.e. equation (0.14) 
= equation (0.12) once angles fa and fa have been substituted into equation (0.12) 
for angles fa and fa should be apparent. Equations (0.13) and (0.15) have the 
following relationship: equation (0.15) = 1 - equation (0.13) once angles fa and fa 
have been substituted into equation (0.13) for angles fa and fa respectively. 

In order to determine a pointer position using the digital cameras 63 
across the bottom left to top right corner diagonal, the following equations are used to 
determine the (x 0 ,y 0 ) coordinates of the pointer position given the angles famd fa 
for bottom left and top right digital cameras: 



•tan(&) 



0 tam> 0 )-tam> 2 ) 



l-£-tam> 2 ) 

y Q = : A .__/^ * tan fo>) (°- 17 > 



tan(0 n )-tanl 



1 5 In order to determine a pointer position using the digital cameras 63 

across the bottom right to top left diagonal, the following equations are used to 
determine the (x 0 ,y 0 ) coordinates of the pointer position given the angles fa and fa 
for the bottom right and top left digital cameras: 

--tan(&) 

zu ^ tan(^,)-tan(^ » 



^0=1" 



l-£-taiu> 3 ) 

ft 

tan(^)-tan(^ 3 ) 



:tan((*,) 



(0.19) 
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The similarity between equations (0.16) and (0.18), i.e. equation (0.18) 
= equation (0.16) once angles <f> x and <f> z have been substituted into equation (0.16) for 
angles and $* 2 should be apparent. Equations (0.1 7) and (0.1 9) have the following 
relationship: equation (0.19) = 1 - equation (0.17) once angles ^, and ^ 3 have been 
5 substituted into equation (0.17) for angles ^ 0 and 0 2 respectively. 

As will be appreciated, the above equations generate the coordinates 
x 0 and y 0 on a scale of [0, 1]. Therefore, any appropriate coordinate scale can be 
reported by multiplying x 0 and y 0 by the maximum X and maximum Y values 
respectively. 

10 In the present embodiment, the DSP 90 calculates the pointer position 

using triangulation for each digital camera pair excluding the diagonal pairs. The 
resulting pointer positions are then averaged and the resulting pointer position 
coordinates are queued for transmission to the personal computer 56 via the serial port 
98 and the serial line driver 94. 

1 5 With the (x,y) position of a pointer known by triangulation, using the 

coefficients A to E calculated during the surface detection calibration, the z 
coordinate corresponding to the (x,y) position can be determined using equation (0. 1). 
Calculating the z coordinate and comparing the z coordinate with the z parameter in 
the PIP provides an indication as to whether the pointer is hovering above the touch 

20 surface 60 or is in actual contact with the touch surface. 

If desired, pointer velocity v and angle can be calculated by the DSP 90 
as shown in Figure 13. The velocity of the pointer is calculated by examining the 
changes in the z-position (or x-intercept) of the pointer in successive PIPs and 
knowing the camera frame rate. For example, if the camera frame rate is 200 frames 

25 per second and the z-position changes by 1 pixel row per frame, the pointer velocity is 
200 pixels per second. 

The angle of the pointer can be determined due to the fact that the PIP 
includes the x-intercept at pixel rows 0 and 19 of the median line. Since the x 
distance (the difference between x-intercepts) and the y distance (the number of pixel 

30 rows) are known, all of the information necessary to calculate the pointer angle is 
available. 
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10 



15 



20 



25 



If desired, a Kalman filter (essentially a recursive least-squares 
method) may be used to effectively "track" the pointer when it is within a certain 
distance of the touch surface 60. To do this, it is necessary to define a system 
equations or model to be used in the filter. Since the master controller 54 is able to 
provide both the position z and velocity v of the pointer, the following description can 
be used: 

z = z 0 +vt 

V = V 

The second of these equations is required as the filter has to know what to do with the 
velocity, and also since both z and v are measurable. Define the state vector as: 

To relate the state of the system at two successive times n and «+l, write the system 
equations as a matrix difference equation: 



z 




'1 diJz 
















V 


ft+1 


0 l}v_ 







or in matrix notation, 

Here, dt denotes the time interval between successive time steps. Also introduced 
here on the RHS is the "process noise" term. It is purely formal, but part of the 
Kalman filter method. It is also necessary to specify how a measurement is 
introduced into the procedure. This is done via the matrix equation: 

z n =Hx„+w 

where z„ is a measurement of position and velocity, H is a "measurement matrix" 
which is taken to be an identity matrix, x n is the state vector and w is measurement 
noise. Essentially, it is assumed that the measurements are noisy versions of the state 
vector. It is also necessary to define a covariance matrix associated with w. If the 
measurement error in z is 0.5 pixel, then the covariance matrix is: 

~1 0~ 



0 1 



A similar matrix Q is required for the process noise introduced above, but as it is 
somewhat arbitrary, it may be treated as a tuning parameter for the filter. In this 
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example, the matrix Q is taken to be an identity matrix multiplied by a factor of order 
unity or less. With the above established, there is sufficient information to start the 
filter process. The first (prediction) step is: 

** + i(-) = ^**(+) 

Here, the (-) notation implies that a measurement has not yet been made while (+) 
does (but in this case the (+) refers to the previous step). Also, the matrix equation for 
matrix P predicts a covariance matrix. The next step is the filter gain computation: 

K k ^P k ^)Hl^ k P k (r)Hl^R k Y 
Once a measurement is made, the state estimate and its covariance can be updated: 

**(+) = **(-)+'*fc-^H] 

It is this estimate of the state x that is used to determine whether or not contact with 
the touch surface has occurred. Note here that the matrices H and R are both constant 
with time, and that only matrices K and P change (in fact, P approaches a constant 
matrix). An additional simplification occurs in that there is no control process 
15 involved. 

The results of a Matlab simulation of a Kalman filter using a set of 
measurements representing a pointer approaching the touch surface 60 at a constant 
velocity was performed. Figures 15 and 16 illustrate the simulation, with a time step 
dt of 0. 1 sec and a measurement precision of 0.5 pixel. The open symbols represent 

20 the data, and the lines the state estimate from the Kalman filter. Clearly, the state 
estimate follows the data quite well. 

A second Matlab simulation was performed to take into account both 
vertical (z) and horizontal (x) motion of a pointer. This simulation is basically two 
similar Kalman filters operating together in a "parallel" fashion. The formulation is 

25 exactly the same, except twice the number of variables need to be considered. Figures 
17a to 1 7d show the results of the simulation and represent movement of a pointer 
towards the touch surface 60 at constant velocity and at a slowly-varying x position 
(i.e. the person's hand is unsteady). 
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Although the touch system 50 has been described as including a 
projector to present images on the touch screen, those of skill in the art will appreciate 
that this is not required. The touch screen 52 may be transparent or translucent and 
placed over a display unit so that the display presented on the display unit is visible 
5 through the touch screen. Also, the touch screen need not be a rectangular sheet of 
material bordered by a frame. The touch screen may in fact be virtually any surface 
within overlapping fields of view of two or more digital cameras. 

Also, although the touch system 50 is described as including a master 
controller separate from the digital cameras, if desired one of the digital cameras can 
10 be conditioned to function as both a camera and the master controller and poll the ' | 
other digital cameras for PIPs. In this case, it is preferred that the digital camera 
functioning as the master controller includes a faster DSP 84 than the remaining 
digital cameras. 

In addition, although the surface detection routine is described as 

1 5 determining the coefficients A to E to be used with equation (0.1) to calculate the z 
coordinates of the pointer at a given point (x,y) relative to the touch screen, during the 
surface detection routine, the master controller 54 can be programmed to calculate a z 
coordinate for unique (x,y) regions of the touch surface and store the z coordinates in 
a look-up table (LUT). In this instance; when a pointer appears in images captured by 

20 the digital cameras and the (x,y) position of the pointer relative to the touch surface is 
determined, a decision can be made as to whether the pointer is in contact with the 
touch surface by comparing the z coordinate in the LUT corresponding with the (x,y) 
region in which the pointer is located, with the pixel row of the image sensor and lens 
assembly at which the pointer tip is located. 

25 As described above, the master controller 54 calculates or looks up t^e 

z coordinates of the touch surface for each digital camera and compares the z 
coordinates with the pointer tip location z to determine if the pointer is in actual 
contact with the touch surface. However, those of skill in the art will appreciate that 
the DSPs 84 in the digital cameras may include image processing software to 

30 determine if the pointer is in actual contact with the touch surface. This image 

processing can be preformed in conjunction with or instead of the master controller 
pointer contact determination. 
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Although a preferred embodiment of the present invention has been 
described, those of skill in the art will appreciate that variations and modifications 
may be made without departing from the spirit and scope thereof as defined by the 
appended claims. 
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WHAT IS CLAIMED IS : 

1 . A camera-based touch system comprising: 

at least two cameras associated with a passive touch surface and 
5 having overlapping fields of view encompassing said touch surface, said at least two 
cameras acquiring images of said touch surface from different locations and 
generating image data; and 

a processor receiving and processing image data generated by said at 
least two cameras to determine the location of a pointer relative to said touch surface 
10 when said pointer is captured in images acquired by said at least two cameras. 

2. A touch system according to claim 1 wherein said at least two cameras 
are digital cameras having fields of view looking generally along the plane of said 
touch surface. 

15 

3 . A touch system according to claim 2 wherein the image data generated 
by each digital camera includes a pointer median line x and a pointer tip location z. 

4. A touch system according to claim 3 wherein each of said digital 

20 cameras includes a pixel array having selectable pixel rows, pixel intensities of pixels 
in said selectable pixel rows being used during generation of said image data. 

5. A touch system according to claim 4 wherein pixel intensities of pixels 
in a region of interest within said selectable pixel rows are used during generation of 

25 said image data. 

6. A touch system according to claim 5 wherein each of said digital 
cameras packages said image data into packets for transmission to said processor to 
provide bandwidth economy. 

30 



7. A touch system according to claim 6 wherein each of said digital 

cameras includes a CMOS image sensor and a digital signal processor, said digital 
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signal processor receiving image output from said image sensor and executing a find 
pointer routine to determine if a pointer is in each image acquired by said digital 
camera, and if so the median line x of said pointer. 

5 8. A touch system according to claim 7 wherein during said find pointer 

routine, said digital signal processor builds a vertical intensity histogram including 
columns of pixel intensities representing differences between an acquired image and a 
background image, the column of said vertical intensity histogram having the largest 
pixel intensity above a threshold level being used to define the center of said region of 
10 interest, the width of said region of interest being defined by columns of said vertical 
intensity histogram on opposite sides of the column defining the center of said region 
of interest that have pixel intensities greater than said threshold level. 

9. A touch system according to claim 8 wherein the digital signal 

1 5 processor of each digital camera analyses the pixels in said region of interest to locate 
the pixel row where the pointer tip is located and determine said pointer tip location z. 

10. A touch system according to claim 9 wherein the digital signal 
processor of each digital camera creates a mask in said region of interest with white 

20 pixels representing said pointer and black pixels representing background to enable 
said median line x and pointer tip location to be calculated. 

11. A touch system according to claim 10 wherein the digital signal 
processor of each digital camera further executes an update background image routine 

25 to update the background image after each image is acquired. 

12. A touch system according to claim 1 1 wherein during said update 
background image routine, the digital signal processor of each digital camera uses the 
equation: 

30 B n+ i (ij) - (1-a) B n (i,j) + al (ij) 

where: 

B n +i is the new background image; 
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B n is the current background image; 
I is the current acquired image; 

i j are the row and column coordinates of the background image pixels 
being updated; and 

5 a is a number between 0 and 1 that indicates the degree of learning that 

should be taken from the current acquired image I 

13. A touch system according to claim 12 wherein the digital signal 
processor of each digital camera further determines the differences between each 

10 acquired image and the background image to detect changing light conditions. 

14. A touch system according to claim 13 wherein each digital camera 
transmits light condition information to said processor, said processor using said light 
condition information to adjust the exposure of each said digital camera. 

15 

15. A touch system according to claim 14 wherein the digital signal 
processor of each digital camera further executes a create packet routine to package 
said image data and light condition information into said packets. 

20 16. A touch system according to claim 3 wherein for image data received 

from each digital camera, said processor calculates an angle ^ C am using the equation: 




25 where: 

x is the number representing the median line or tip of the pointer; and 
a is the total length enclosed by the field of view (FOV) of the digital camera at a 
distance from the camera; said processor using the calculated angles to determine the 
position of the pointer relative to said touch surface using triangulation. 
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17. A touch system according to claim 16 wherein said calculated angles 
are adjusted to take into account digital camera offsets prior to determination of said 
pointer position. 

5 

18. A touch system according to claim 17 wherein said digital camera 
offsets are determined during execution of a digital camera calibration routine. 

19. A touch system according to claim 17 including at least three digital 
10 cameras, said processor determining the pointer position using triangulation for pairs 

of digital cameras and averaging the determined pointer positions. 

20. A touch system according to claim 19 wherein said processor further 
executes a touch surface determination routine to calculate the orientation of the touch 

1 5 surface as seen by each digital camera to. determine when the pointer is in contact 
with said touch surface and when said pointer is hovering above said touch surface. 

21 . A touch system according to claim 20 wherein said processor further 
calculates the velocity of a pointer as said pointer moves toward said touch surface 

20 within the fields of view of said digital cameras. 

22. A touch system according to claim 20 wherein said processor tracks 
said pointer within the fields of view of said digital cameras. 

25 23. A touch system according to claim 21 wherein said processor tracks 

said pointer using at least one Kalman filter. 

24. A touch system according to claim 19 further including a computer 

coupled to said processor, said computer receiving said pointer location from said 
30 processor. 



25. 



A camera-based touch system comprising: 
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a generally rectangular passive touch surface on which contacts are 
made using a pointer; 

a digital camera mounted adjacent each corner of said touch surface, 
said digital cameras having overlapping fields of view encompassing said touch 
5 surface, said digital cameras acquiring images of said touch surface and generating 
image data that includes the median line x and pointer tip Ideation z of a pointer when 
said pointer is captured in images acquired by said digital cameras; and 

a processor receiving and processing image data generated by said 
digital cameras to determine the location of said pointer relative to said touch surface 
10 and whether said pointer is in contact with said touch surface. 

26. A touch system according to claim 25 wherein each of said digital 
cameras includes a pixel array with selectable pixel rows, pixel intensities of pixels in 
said selectable pixel rows being used during generation of said image data. 

27. A touch system according to claim 26 wherein pixel intensities of 
pixels in a region of interest within said selectable pixel rows are used during 
generation of said image data. 

20 28. A touch system according to claim 27 wherein each of said digital 

cameras includes a CMOS image sensor and a digital signal processor, said digital 
signal processor receiving image output from said image sensor and executing a find 
pointer routine to determine if a pointer is in each image acquired by said digital 
camera, and if so the median line x of said pointer. 

25 

29. A touch system according to claim 28 wherein during said find pointer 

routine, said digital signal processor of each digital camera builds a vertical intensity 
histogram including columns of pixel intensities representing differences between an 
acquired image and a background image, the column of said vertical intensity 
30 histogram having the largest pixel intensity above a threshold level being used to 
define the center of said region of interest, the width of said region of interest being 
defined by columns of said vertical intensity histogram on opposite sides of the 
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column defining the center of said region of interest that have pixel intensities greater 
than a threshold level. 

30. A touch system according to claim 29 wherein the digital signal 

5 processor of each digital camera analyses the pixels in said region of interest to locate 
the pixel row where the pointer tip is located and determine said pointer tip location z. 

31. A touch system according to claim 30 wherein the digital signal 
processor of each digital camera creates a mask in said region of interest with white 

10 pixels representing said pointer and black pixels representing background to enable 
said median line x and pointer tip location to be calculated. 

32: .A touch system according to claim 3 1 wherein the digital signal 

processor of each digital camera further executes an update background image routine 

i 

15 to update the background image after each image is acquired. 

33 . A touch system according to claim 32 wherein during said update 

background image routine, the digital signal processor of each digital camera uses the 
equation: 

20 B^ 1 (U) = G-a)B n (y)+ al(y) 

where: 

B n +i is the new background image; 

B n is the current background image; * 
I is the current acquired image; 
25 ij are the row and column coordinates of the background image pixels 

being updated; and 

a is a number between 0 and 1 that indicates the degree of learning that 
should be taken from the current acquired image I. 

30 34. A touch system according to claim 33 wherein the digital signal 

processor of each digital camera further determines the differences between each 
acquired image and the background image to detect changing light conditions. 
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35. A touch system according to claim 34 wherein each digital camera 
transmits light condition information to said processor, said processor using said light 
condition information to adjust the exposure of each said digital camera. 

5 

36. A touch system according to claim 35 wherein the digital signal 
processor of each digital camera further executes a create packet routine to package 
said image data and light condition information into packets for transmission to said 
processor. 

0 

37. A touch system according to claim 26 wherein for image data received 
from each digital camera, said processor calculates an angle <j> C am using the equation: 




15 

where: 

x is the number representing the median line or tip of the pointer; and 
a is the total length enclosed by the field of view (FOV) of the digital camera at a 
distance from the camera; said processor using the calculated angles to determine the 
20 position of the pointer relative to said touch surface using triangulation. 

38. A touch system according to claim 37 wherein said calculated angles 
are adjusted to take into account digital camera offsets prior to determination of said 
pointer position. 

25 

39. A touch system according to claim 37 wherein said processor further 
executes a touch surface determination routine to calculate the orientation of the touch 
surface as seen by each digital camera to determine when the pointer is in contact 
with said touch surface and when said pointer is hovering above said touch surface. 
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40. A touch system according to claim 39 wherein said processor further 

calculates the velocity of a pointer as said pointer moves toward said touch surface 
within the fields of view of said digital cameras. 

5 

41 . A touch system according to claim 40 wherein said processor track 

said pointer within the fields of view of said digital cameras. 

42. A touch system according to claim 41 wherein said processor tracks 
10 said pointer using at least one Kalman filter. 

43. A touch system according to claim 26 wherein said touch surface is 
bordered by a frame and wherein each of said digital cameras is mounted on a frame 
assembly positioned at a corner of said touch surface that is coupled to a frame, each 

1 5 digital camera being oriented so that the field of view thereof looks downward and 
generally along the plane of said touch surface. 

44. A touch system according to claim 26 further including a computer 
coupled to said processor, said computer receiving pointer location information from 

20 said processor and using said pointer location information to update an applications 
program executed thereby. 

45. A touch system according to claim 44 wherein computer display 
information is presented on said touch surface. 

25 

46. A method of detecting the position of a pointer relative to a touch 
surface comprising the steps of: 

acquiring images of said touch surface from different locations using 
cameras having overlapping fields of view and generating image data; and 
30 processing said image data to detect the existence of a pointer within 

said acquired images and to determine the location of said pointer relative to said 
touch surface. 



WO 02/03316 



PCT/CA01/00980 



-35- 



47. The method of claim 46 wherein during the processing step, the 

location of said pointer relative to said touch screen is determined using triangulation. 

5 48. The method of claim 47 wherein during said processing step, the 

images are processed to determine when said pointer is in contact with said touch 
surface and when said pointer is hovering above said touch surface. 

49. The method of claim 48 wherein said processing step further includes 

1 0 the step of tracking the pointer as the pointer approaches the touch surface. 
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